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The ability to identify jets containing B hadrons is important for the high-py physics program of a general-purpose 
experiment such as ATLAS, ft-tagging is in particular useful for selecting very pure top quark samples, for studying 
standard model or supersymmetric Higgs bosons which couple preferably to heavy objects, for vetoing backgrounds 
for several physics channels and finally for searching for new physics beyond the standard model like supersymmetric 
particles or heavy gauge bosons. After a review of the algorithms used to identify 6-jets, their anticipated performance 
is discussed as well as the impact of selected critical ingredients such as residual misalignments in the tracker. The 
prospects to measure the fe-tagging performance in the first few hundreds pb" 1 of data with di-jet events as well as it 
events are presented. Finally three different physics use cases of fe-tagging are summarised. 
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1. b-TAGGING ALGORITHMS AT ATLAS 

Bottom jets possess several characteristic properties that can be utilised for separating them from jets coming from 
the hadronisation of lighter quarks. The most important property is the relatively long lifetime of B hadrons of the 
order of 1.5 ps, which leads to a measurable flight length path of typically a few millimeters before the subsequent 
decay into lighter hadrons. A significant fraction of B hadrons decay semi-leptonically into hadrons containing charm 
' quarks, which then decay further into lighter hadrons. The decay of the B hadrons at a displaced secondary vertex 
i can be identified inclusively by measuring the impact parameters of tracks coming from the decay, that is the distance 
from the point of closest approach of the track to the interaction vertex. The impact parameter itself is a signed 
quantity, where it is chosen positive if the point of closest approach lies upstream with respect to the jet direction and 
negative in the other case. Apart from that, the secondary vertex can be reconstructed explicitly, either inclusively 
or exclusively, the latter implying a full reconstruction of the B hadron decay chain. In addition the relatively large 
mass of more than 5 GeV produces opening angles and transverse momenta of the decay products relative to the 
B hadron flight direction that differ significantly from other jets. The various tagging methods studied at ATLAS 
can be divided into two main classes. The spatial taggers encompass methods that utilise lifetime information like 
impact parameters and decay vertices. The soft-lepton taggers on the other hand are based on the reconstruction 
of the lepton in case the B hadron decayed semi-leptonically. These non-isolated leptons have a sizable tranverse 
momentum as well as a large transverse momentum relative to the jet axis p r ^ 1 ■ The first property comes from the 
fact that the b quark fragmentation is hard, that is the B hadron retains a major fraction of the original b quark 



momentum. Detailed information on the presented results and 6-tagging at ATLAS in general can be found in the 
chapter on 6-tagging in [1] and references therein. 

Apart from a few simple taggers, that are intended mainly for the early start-up phase of the experiment where one 
prefers simple, robust and easy to calibrate taggers, all tagging methods rely on the likelihood ratio approach to build 
a discriminating variable, called jet weight, for the separation of 6-jets, c-jets and other jets. For this purpose one or 
more dimensional reference histograms of discriminating variables are constructed with a Monte Carlo simulation for 
the different hypotheses. In some cases the weights of different taggers are combined to reach better discrimination. 
In the following only the separation between 6-jets and light-jets is considered for simplicity. All jets having a jet 
weight above a certain cut value are then tagged as 6-jets. The specific cut value determines the ^-tagging efficiency 
£b, which is defined as the fraction of true &-jets that are tagged as 6-jets, as well as the rejection of light -jets R u . 
The latter is defined as the inverse of the fraction of true light-jets that are falsely tagged as &-jets. For a given cut 
on the weight the rejection of light jets as well as the efficiency, in general, strongly depend on r\ and px of the jet, 
see Figure 1. 

The simplest taggers just tag a jet as a &-jet, if either there are at least N tracks associated to the jet with impact 
parameter (IP) larger than a fixed value (TrackCounting), or the B hadron flight distance significance as calculated 



1 



34 International Conference on High Energy Physics, Philadelphia, 2008 




P T (GeV) b-jet efficiency 



Figure 1: Left: b-tagging efficiency obtained with the IP3D+SV1 tagging algorithm operating at a fixed cut on the b-tagging 
weight for it events versus jet pr and jet \r)\ respectively [1]. Right: Rejection of light jets versus b-jet efficiency for it events 
and for the following tagging algorithms: JetProb, IP2D, IP3D, IP3D+SV1, IP3D+JetFitter[l]. 



from the secondary vertex position exceeds some cut value (SVO). Another tagger (JetProb) uses the negative 
side of the (transverse) IP significance distribution as obtained from prompt tracks to calculate the probability 
of compatibility of the tracks in the jet with the primary vertex. This probability is then used as jet weight. 
As this tagger uses only information from prompt tracks it is expected to be simpler to calibrate than the more 
sophisticated likelihood ratio taggers, which in addition use reference distributions for 6-jets. A more sophisticated 
impact parameter based tagger, that uses likelihood ratios, called IPxD, utilises the distributions of the IP significance 
as calculated in the longitudinal projection (IP ID), the transverse projection (IP2D) or a combination of both in 
two-dimensional histograms (IP3D). Both IPxD and JetProb divide all tracks into categories with dedicated reference 
distributions before calculating the jet weight to improve the performance. 

One secondary-vertex tagger (SV1/2) fits inclusive secondary vertices and builds the jet weight from several one- 
or more-dimensional variable distributions like vertex mass, energy fraction of the tracks fitted to the vertex to all 
tracks in the jet, number of tracks in the vertex and angle between jet direction and B hadron flight direction as 
estimated from the primary-secondary vertex axis. In addition, the resulting jet weight is combined with the IP3D 
tagger. The tagger with the best performance (JetFitter) fits the decay chain of B hadrons, i.e. it fits a common 
B/C flight direction along with the position of additional vertices on it. This even includes incomplete topologies 
where no vertex could be fitted. The jet weight is calculated similarly to SV1/2, but taking different decay topologies 
into account via categories. The JetFitter algorithm can also be combined with IP3D for optimal performance. 

Two soft-lepton tagger approaches are pursued at ATLAS. One uses soft muons and one or two dimensional 
reference histograms of the muonic Pt and the muonic p^ 1 . The other uses electrons and relies on the challenging 
identification of soft electrons inside jets. 



2. PERFORMANCE IN MONTE CARLO SIMULATIONS 

The performance of the tagging methods is estimated on Monte Carlo simulated events. For the following results 
it events were used, where jets were reconstructed via a seeded cone algorithm based on calorimeter towers with a 
jet cone radius of AR = 0.4. A true 6-jet is then defined as any reconstructed jet for which there exists at least one 
b quark with pr > 5 GeV in a cone of AR = 0.3 around the jet axis. Light jets are jets where neither a b quark nor 
c quarks nor tau leptons could be matched to the jet in this way. Furthermore, all jets have to pass the preselection 
criteria pr > 15 GeV and \rj\ < 2.5. Tracks are associated to jets also via a AR criterion with a default value of 
AR = 0.4. A snapshot of the expected light jet rejection as a function of the efficiency can be found for the spatial 
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taggers in Figure 1. One can expect R u <~ 30 for the tagger JetProb up to R u <~ 200 for the sophisticated JetFitter 
at a typical 6-tagging efficiency of £{, = 60%. The soft muon tagger for example gives R u ~ 300 at e;, = 10%, where 
includes branching ratios. A separation of charm jets and &-jets is much more difficult due to very similar properties 
of both types of jets leading to a rejection that is typically a factor of <~ 10 smaller than for light jets. Since the 
JetFitter algorithm deals with the actual topology of the decay, it handles c-jets better than the other methods 
reaching a charm rejection of <~ 20. It is interesting to note that 6-tagging is possible on trigger level with a light jet 
rejection of <~ 20 at 60% at the last trigger stage. 

An effort has been made to reach a realistic understanding of critical aspects of 6-tagging. The studies sum- 
marised above where done assuming a perfect knowledge of all misalignments. More realistic studies take residual 
misalignments into account as well as the process of realignment including systematic errors. Recent studies indicate 
a possible degradation of the light jet rejection of up to <~ 25% at most for fixed e^, but not all systematic effects 
have been taken into account yet. Further degradations in rejection are seen in studies including pile-up events (<~ 5 
minimum bias events are expected at an instantaneous luminosity of 2 • 10 33 cm _2 s _1 ), where in a few percent of the 
cases a wrong primary vertex is reconstructed leading, besides other things, to an artificial shift in the longitudinal 
IP and finally to a loss in rejection of <~ 20 — 30% for IP3D and IP3D+SV1. Studies are carried out to recover 
the lost performance by optimising the 6-tagging for the case of pile-up. The taggers that take only the transverse 
impact parameter into account are not affected by pile-up though. 

3. PROSPECTS FOR PERFORMANCE MEASUREMENTS 

It will be necessary to calibrate the 6-tagging methods on data. This means that one has to measure the tagging 
efficiency as well as the mis-tagging efficiency, i.e. the inverse of the rejection. Due to the dependence on pr and rj 
it is desirable to perform the calibration in bins of those variables. In addition one would also like to measure the 
used reference histograms with data. For the efficiency measurement several methods have been studied at ATLAS 
that make use of either di-jet or it events. 

The pjT 1 method uses events with jets that include non-isolated muons. Templates of the muon p^ 1 as obtained 
from simulated and reconstructed b-, c- and light-jets passing basic selection criteria are fitted to the measured 
distribution before and after applying the respective tagger, which preferably is a spatial tagger. By counting the 
number of muonic jets before and after the tagging, the efficiency e& can be estimated. Another method that uses 
muonic jets is called System 8. This method needs two event samples with different flavor composition, preferably 
enriched with 6-jets, as well as two uncorrelated tagging methods. For the first sample one selects jets with non- 
isolated muons, whereas the other is a subsample with one additional b-tagged jet in the opposite hemisphere. For 
the taggers one can use a spatial one and a soft lepton tagger. By solving eight equations with eight unknowns one is 
able to determine the 6-tagging efficiency. A special trigger for muonic jets was implemented to collect the necessary 
amount of events and by using different trigger thresholds and prescaling it is possible to use these methods in bins 
of pr and rj with a fixed amount of data. It was shown that for both methods an integrated luminosity of ~ 50 pb -1 
is sufficient to derive detailed calibration curves in pt or rj. The relative precision of the absolute value of e& is 
estimated for both methods to be at the 6% level, which is limited by systematic errors. 

There are several methods that make use of it events. First, there is an event counting method, that measures the 
average 6-tagging efficiency and the cross section of it production in the lepton+jets or the dilepton channel at the 
same time, by counting the number of events with 1, 2 or 3 tagged jets. With ~ 100 pb" 1 of data a relative precision 
of ~ (2.2(stat.)±3.5(sys.))% can be reached in the lepton+jets channel. In order to measure the efficiency in bins of 
Pt and rj one identifies 6-jcts by fully reconstructing the it decay chain in the lepton+jets channel. The classification 
is improved by requiring a b-tag for one jet coming from the hadronically decaying top. The 6-tagging is then studied 
on the other unbiased side. Three methods are explored for selecting such samples, a kinematic fit, a likelihood 
selection and a topological selection. With ~ 200 pb™ 1 of data a relative precision of ~ (7.7(stat.)±3.2(sys.))% can 
be reached for example for the topological method. With a larger amount of data the latter techniques could also be 
used to extract the reference distributions for 6-jets. 
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4. PHYSICS USE CASES 

There are various examples of use cases where 6-tagging is a critical ingredient. One is the search for a standard 
model Higgs boson in the ttH(H — > bb) channel. Here 6-tagging can be used to reduce or even eliminate large 
background like ttjj or W+jets. For example the ttjj background is reduced by two orders of magnitude by using 
6-tagging. Using 6-tagging the signal significance is estimated to be S/V~B ~ 2.2 at an integrated luminosity of 
~ 30 fb -1 . Another example is the top mass measurement. There the highest precision can be reached in the 
it lepton+jets channel by requiring two b-tagged jets using the hadronically decaying top as the mass estimator. 
Assuming a jet energy scale uncertainty of the order of one percent, a precision of 1 GeV can be reached with an 
integrated luminosity of 1 fb -1 , which then would be already dominated by the systematic errors. A complementary 
approach relying on 6-tagging infers the top mass from the mean transverse decay length of B hadrons coming from 
the top decays. Here the uncertainty due to the jet energy scale is negligible. Finally 6-tagging is also important for 
exotic models, where one has to reconstruct very high-p T jets above 500 GeV. These jets are challenging for tracking 
as well as 6-tagging, due to the high track density and multiplicity as well as the fact, that the B hadron decay 
lengths are so large that the secondary vertex can be located around or after the first detector layers. 

5. CONCLUSIONS 

Various algorithms for tagging 6-jets have been studied in detail at ATLAS. The spectrum covers simple, robust 
taggers as well as sophisticated taggers, that make use of as much information as possible from the B hadron 
decay chain. Several approaches for calibrating 6-tagging algorithms with data were shown to be realisable with 
a few 100pb _1 of data, ^-tagging is essential for many physics analyses, like Higgs boson searches or top mass 
measurements. 
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